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,  ABSTRACT 

S  This  paper  studies  how  to  identify  hidden  factors  in  multivariate  time 
series  process.  It  is  shown  that  the  number  of  factors  must  be  equal  to  the 
rank  of  both  the  covariance  matrices  and  the  parameter  matrices  of  the 
infinite  moving  average  representation  of  the  process*  A  canonical 
transformation  is  derived  which  can  recover  such  factors.  The  method  is 
illustrated  with  several  examples.  Or ^  ri fUrlt, 
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SIGNIFICANCE  AND  EXPLANATION 


A  central  problem  in  the  study  of  multivariate  data  is  the  reduction  of 
dimensionality.  This  problem  is  specially  acute  in  modeling  multiple  time 
series  because  if  the  number  of  series  is  large,  a  huge  number  of  parameters 
may  be  needed  to  obtain  an  adequate  representation  of  the  behavior  of  the 
process.  This  paper  explores  ways  to  reduce  the  dimensionality  of  the 
observed  process  through  the  extension  of  the  static  factor  analysis  model  to 
the  dynamic  context.  It  is  shown  how  the  number  of  factors  can  be  identified, 
how  the  loading  matrix  can  be  estimated  and  how  a  canonical  transformation  can 
be  built  to  recover  the  factors  and  to  obtain  a  simpler  representation  of  the 
process.  The  usefulness  of  this  t  is  illustrated  with  several  examples . 
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A  FACTOR  ANALYSIS  FOR  TIME  SERIES 


Daniel  Pena*  and  George  E.  P.  Box 

INTRODUCTION 

A  central  problem  In  the  study  of  multivariate  observations  is  the  reduction  of 
dimensionality.  In  the  time  domain  study  of  vectors  of  time  series,  Quenoulli  (1957) 
suggested  how  to  use  the  eigenvectors  of  the  parameter  matrices  of  an  autoregressive 
proceas  to  obtain  a  simpler  interpretation  and  the  system.  Box  and  Tiao  (1977)  showed  how 
to  build  a  canonical  transformation  of  an  autoregressive  vector  process  which  order  the 
components  from  least  to  most  predictable,  and  Rainael  (1983)  presented  how  to  estimate 
index  variables  to  simplify  the  representation  of  a  vector  autoregressive  time  series  using 
a  formulation  previously  suggested  by  Sargent  and  Sims  (1977). 

Brillinger  (1981)  extended  the  standard  principal  congxanent  approach  to  the  frequency 
domain.  Related  work  is  due  to  Priestley  (1981)  and  Subba  Rao  (1975).  Geweke  (1977)  and 
Geweke  and  Singleton  (1981)  discussed  a  frequency  domain  version  of  factor  analysis. 
Finally,  Hannan  (1981)  has  presented  methods  to  estimate  the  dimension  of  a  linear  system. 

In  this  paper  a  factor  analysis  of  time  series  in  the  time  domain  is  proposed.  It  is 
assumed  that  an  observed  k-vector  of  time  series  could  be  written  as 

5t  *  *  *t  n'1) 

where  P  is  a  k  x  r  matrix  of  unknown  parameters,  is  an  unobservable  r-dimansion 

vector  proceas  and  is  a  k-dimensional  white  noise  sequence  with  full  rank  covariance 

matrix  £e.  This  model  is  of  course  only  relevant  either  if  r  <  k  or  if  r  ■  k  but 

■  0.  In  the  first  case  a  reduction  of  dimensionality  could  be  achieved  without  loss  of 
information.  In  the  second  case,  r  “  k,  an  interesting  problem  is  to  find  out  a  linear 
transformation  of  the  aeries  that  allows  a  simpler  representation  of  the  system. 
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In  Section  2  the  general  aodel  (1.1)  is  analysed  and  eoae  of  its  properties  are 
studied.  A  canonical  transformation  can  then  he  introduced  to  recover  the  factors.  In 
Section  3  the  one  factor  eodel  is  studied  in  detail,  and  in  Section  4  a  practical 
methodology  to  apply  these  procedures  is  developed.  Section  5  includes  several  examples  of 
the  application  of  the  suggested  models,  and  in  Section  6  this  methodology  is  cohered  with 
related  approaches  in  the  time  domain.  Finally,  Section  7  contains  some  concluding 


2.  THE  GEHEBAL  PHCOOTLE  FACTORS  HOPEI. 

2.1  Formulation 

Let  Lt  be  a  vector  of  k  tine  series  and  st  -  Lt  -  Pm  the  vector  of  deviations 
from  aoae  origin  u(  that  will  be  the  aean  if  the  series  are  stationary.  We  assume  that 
these  series  are  generated  by  r(r  <  k)  factors,  yt,  plus  a  aeaaureaent  error  ct  as 

«t  -  Rt  *  St  <2,1) 

where  P  is  a  k  x  r  aatrlx  of  paraaeters  of  rank  r  and  gt  ia  a  white  noise  sequence 
with  full  rank  covariance  aatrlx  •  B>e  vector  ^  follows  a  r-dimensional  ASHA 
(Py,qy)  process  of  the  forat 

«  VB)*t  (2*2> 

where 

ty(B)  •l  -  -  ...  -  |y(P)Bp 

8y(B)  -  |  -  8^1)8  -  ...  -  6y(q)Bq 

are  aatrlx  polynoaiala  in  the  backshift  operator  B,  the  |'s  and  the  J's  are  r  x  r 
aatricee  and  the  roots  of  the  deteralnantal  polyncaial  }^(B>  |  are  on  or  outside  the  unit 
circle,  whereas  those  of  Ig^B)!  are  all  outside  the  unit  circle.  Also,  {a,.}  lea 
sequence  of  vector  Gaussian  white  noise  with  sero  aean  and  covariance  aatrlx  1^.  lie  shall 
asstaa  in  this  section  that  the  r  factors  are  independent,  and  all  the  £  and  £ 
aatrices  will  be  diagonal.  However,  we  will  allow  contemporaneous  dependency  in  the  noise 
aatrlx  and  assuae  only  that  is  positive  definite.  In  Section  4  this  aodel  will  be 

generalised  to  the  case  in  which  the  factors  have  dynaaic  dependence  and  the  paraaeter 
aatrices  |y  and  2y  ar*  longer  diagonal. 

The  aatrlx  P  will  be  called  the  factor  loadings  or  factor  weights  astrlx,  and  its 
elements,  Pj,j»  represent  the  weight  of  the  factor  j  in  the  observed  ith  component.  If 
5  is  any  r  x  r  nonsingular  aatrlx,  the  generating  equation  could  also  be  written 

*t  “  *  St 

where  g#  -  Pjj"1  is  the  new  rectsngular  matrix  of  coefficients  and  y*  - 
transformation  of  the  factors.  Multiplying  (2.2)  by  Ji 


is  a  linear 


Cfr(B)C~1CyL  »  ce<B)C_1Cat 


and  ao,  the  model  for  the  new  aet  of  factors  is  again  an  r-dimensional  ARMA  <Py,qy)  model 
with  parameters 

$*<B)  -  C^(B)C-1 
8*(P)  -  ce(P)C-1 

£*  -  CZ  C* 

■a  --a* 

In  general,  the  restriction  that  the  £  and  6  parameter  matrices  of  the  components  are 
diagonal  imposes  a  uniqueness  condition.  However,  if  £(B)  “I  -  IB  and  6(B)  «  J ,  that 

is,  the  factors  follow  and  nonstationary  ARID  with  all  the  roots  in  the  unit  circle, 

some  indeterminacy  appears,  because  whatever  the  £  matrix  the  new  factor  will  be 
uncorrelated  at  all  lags.  A  possibility  is  to  choose  C  to  diagonalise  £*£  and  at 

the  same  time.  In  this  way,  not  only  the  factors  are  uncorrelated  to  all  lags  and 

contemporaneously ,  but  also  the  columns  that  transmit  their  effect  to  the  observed  series 
are  orthogonal. 

However,  if  C  is  diagonal,  that  is  can  be  interpreted  as  a  change  of  scale,  the 
transformed  parameter  matrices,  |*(B)  and  0*(B)  are  always  diagonal  and  £•  keeps  its 
basic  features.  To  remove  this  source  of  indeterminacy  we  assume  that  the  columns  of  the 
P  matrix  of  factor  loadings  are  such  that 


l 

i-1 


i  j 


1 


The  objective  of  the  analysis  will  be  to  estimate  these  factor  loadings  and  to  build  a 
canonical  transformation  to  recuperate  the  hidden  factors. 


2.2  The  Covariance  and  Correlation  Matrices  of  the  Observed  Process 

Let  us  call  r^Jk)  »  E the  covariance  matrices  of  the  process  ft  and  £y(k) 
the  covariance  matrices  for  the  generating  vector  .  Then: 

rz(°)  -  P|M0)P'  +  £e  (2.3) 


-4- 


-  fiyOOf'  k  >  1  (2.4) 

and  tha  rank  of  r#(k)  for  k  >  1  will  ba  equal  to  tha  number  r  of  coaaor.  factors. 
Also,  if  tha  factors  ara  independent  for  all  lava,  and  tha  matrix  is  diagonal,  than 
all  tha  covariance  aatricaa  r^(k)  will  ba  diagonal  and,  hancat  (1)  tha  aatricaa  £((k) 
will  ba  ayaaatrlc  for  k  >  1,  (2)  tha  columns  of  g  will  ba  aiganvactora  of  £#(k)  with 
eigenvalues  y^k),  whara  y^k)  ara  tha  diagonal  elements  o*  £ytJO- 

The  partial  autoeorralation  aatricaa  will  ba  givan  by  (aaa  Tiao  and  Box  (1981)) 

£1(°)£«(i)  i  -  i 

P'(i)  -  trs(0)  -  b'U^-’ujbUM"1 

[£t(i)  -  bMtJ^’u)  _  (i)l  i  >  1 


b'  ( t)  -  (P(i  -  1) P(i  -  2)  ...  £'{1)] 

e'U)  -  trvi>r>(2)  ...  r*<i  -  nj 


..  r*(t-2)  1 


£(i-2) 


and  ao,  tha  partial  autoeorralation  aatricaa  can  ba  written 

1-1  i-1 

PMi)  -  |(i)[£  (l)  -ll  P(i  -  DjfV.j.Or  (j)J 
i-1  j-1 

whara  B(t)  -  [£^(0)  “  b*  (t)A  '(t)b(t)]  1  and  A  '(1,3,1)  ia  tha  (i,J)  elaaant  of 
A-1(t).  than 

i-1  i-1  , 

P*(i)  -  *(i)P(r  (i)  -  l  l  £Mi  -  DP’ft"  (i,j,i)Pr  (j))P*  (2.5) 

^  i-1  j-1  ^  ^ 

Mow,  if  wo  call  v  tha  null  apaca  of  P  defined  by  all  tha  vectors  that  verify  J'v  -  0 


•*  -S  -  #  /*  ,  »  ,  -  _  «  .  •  ,  »  ,>  «>  ,  *  »  «  _  « 


and  if  V  is  a  k  x  (k  -  r)  matrix  which  columns  are  a  basis  for  this  space  v,  then 


P' <t)V  -  -  0 

this  shows  that  (1)  the  rank  of  P'U)  is  at  most  r,  and  (2)  the  eigenvectors  linked  to 
zero  eigenvalues  of  P’(t)  belong  to  the  v  space. 

As  an  example,  and  for  further  reference,  we  derive  the  first  partial  autocorrelation 
matrix  assuming  that  the  factors  follow  an  AR{1)  process  with  parameter  matrix 


p < i )  -  r •  { i »r-1  to »  -  pr*cnp'(pr  «j)p*  +  z  )-1 

■  **  »z  mmy  m  mmy  m  •€ 


and  using  the  relations 


<Ie  +  2£o<y>r>_1  -  C-  +  V0,£,^1s)"l£y(0,2’5e"1 


*  -  r*(i)r“  to) 
*y  -y  -y 


and  calling  A  *  P 1 1 P ,  we  obtain 


Pd)  -  p*  r  (old  -  a(i  +  r  (o)A)-1r  (O)jp'rT1 

■  mmymy  m  mm  my  m  my  m  “C 

now,  using  the  relations  (Handerson  and  Searle  (1981)) 


(I  +  P) 


I  -  P(I  +  P)*1  -  I  -  (1  +  P)_1P 


(i  +  £2)-12  *  E (l  *  8E1"1 


we  obtain 


Pd)  -  per  (Old  ♦  Ar  tojrViT1 

m  Myay  m  >.y  a  ag 


The  matrix  Ar  (0)  can  be  considered  as  a  measure  of  the  ratio  between  the  signal 

»«y 

introduced  by  the  factors  in  the  observed  Beries  and  the  noise  due  to  efc.  For  instance, 

2 

in  the  one  factor  case,  assuming  “  oeI,  then 


AT  (0) 

—y 


where  are  the  components  of  the  vector  P  and  Yy<0)  the  variance  of  the  factor. 

The  above  expression  shows  that  a£  (0)  compares  the  variance  of  every  component  of  zt 

2  2 
due  to  the  factor,  p,y  (0),  with  the  variance  introduced  by  the  error,  a  . 


In  tht  gtatril  oaNf  it  this  Matrix  is  anoh  grsatsr  than  J  (that  l*,  ths  signal 
aoiss  ratio  is  larger  than  oaa)  than 

(J  ♦  ■  | 

and  if  m  Multiply  (2.6)  by  £ 

£(1,l  *  ttyiy(0)(I  +  SEy<0>,-1ft  *  Sty 

and  ao,  approxiaataly ,  P(  1 )  baa  aiqanwaluaa  aqual  to  tha  diagonal  alauanta  of  w 

corraapondlnq  aiqanractors  tha  coluana  of  P. 


r  tha  Ob  aa  ft  ad  Prooaaa  and  Zta  Paraaatara 


tha  propartiaa  wa  hara  atudiad  for  tha  oovarlanca  and  oorralation  uatrioaa  ixpoaa 
atronq  restriction  on  tha  typa  of  nodal  tha  observed  ;t  v actor  of  sariaa  can  follow, and 
to  tha  char act ariatica  of  ita  parameter  valuaa • 

Theorem  1.  Suppose  ^  where  yt  ia  a  r-dimensional  MMX  (py,gy)  procaaa, 

£  ia  a  k  a  r  Matrix  (k  >  r)  of  rank  r  and  Jt  ia  a  k-dinanaional  whita  noiaa 
aaqganca  with  coverianea  Matrix  than,  a  follow  a  k-dinanaional  AJMA  (ps,qB> 

*lth  pB  -  Py,  qB  -  Max(py,qy). 


Proof. 


tha  covariance  natricaa  of  an  AHA  (p,q)  nodal  satisfy  tha  aquation  (aaa  tlao  and 


Box  (1981 ) )  i 


£/*>  - 


j-L  V3>*yu  "  3»  -  V3,Wy(J  *l)  1  " 

m 

iyl*  *  *  J>  *  > 


whera  a  •  aax(py,qy),  tha  p's  ara  obtain  ad  fron  tha  relationship  £(■)  “  £  (B)O(B), 

and  *  -J.  It  la  aaausnd  that  (a)  if  p  <  q,  |(p  ♦  1)  ■  •••  “  |(n)  “  0,  and  (b)  if 

q  <  p,  8(q  +  1)  ■  6(a)  -  0.  Praaultiplyinq  tha  ayatan  of  aquation a  by  £  and 
poataultiplyinq  by  £'j 


MSSW?®'' 


1,(0)  -  l  E,(J>t;(-l>  +  £e  +  ££*£’ 

V 

lzit)  -  I  £2<j)i;u  -  j>  -  ijijU)  t  - 
j 

m 

r  (t)  -  I  £  a  -  jjjMt  -  j)  i  >  p 

*  j-i  * 

and  writing 

£e  +  ££a£'  "  * 

j-0 

m-i 

m;(4)  *  i  + 15  4  ■ 1 . p 

j-0 

the  process  will  follow  an  ARMA  (P,P)  model.  In  the  general  case,  if  py  >  qy, 
m  «  py,  the  first  py  covariance  matrices  will  be  given  by  (2.9)  and  (2.10)  and  so  they 
will  not  follow  the  AR  pattern  and  an  MA(p)  component  will  be  needed  in  addition  to 
the  AR(p)  linked  to  the  pattern  of  (2.12).  So,  the  required  model  will  be  ARMA  (p,p). 

On  the  other  hand,  if  qy  >  py  the  first  qy  matrices  will  follow  the  nonsystematic 
structure  and  the  process  will  be  ARMA  (py ,qy) • 

The  representation  obtained  for  ^  is  not  unique,  as  shown  in  the  following: 

Theorem  2.  If  zt  is  an  observed  vector  time  series  generated  by  r  common  factors  as  in 
(2.1)  and  which  follows  model  (2.2),  then  *  could  be  equally  represented  as 
|*(B)rt  -  8*(B)at  with  new  parameters  ft*  -  +  ft,  8 *  -  9^  +  ft,  where  ft  is  a  nonnull 

matrix  of  rank  ■  k  -  r. 

Proof . 

Let  m  «  max(p,q),  then,  assuming  8(q  +  1)  -  ...  •  6(m)  «  0  if  m  >  q  and 
9(p  +  1)  «  ...  «  9(m)  “0  if  m  >  p: 

8*(B)  -  (I  -  (0,  +  A)B  -  ...  -  (Sjh  +  ft>Bm)  -  6(B)  +  AI^B) 

**(B)  -  *(B)  +  AI  (B) 

*  m 


and  the  condition  for  the  matrices  8*<B>  and  $*(B)  to  be  equally  acceptable  as 


parameter  matrices  of  the  process  is 


$(B)*t  +  ftJ^B)^  "  $(B)*t  +  M„(B)at 

and  so,  the  condition  is 

a*t  “  tit  (2.13) 

and  there  must  be  k  -  r  linear  combination  of  the  series  that  are  random  noise.  Suppose 
that 

5t  “  +  £t 

if  we  multiply  for  any  matrix  V*  such  that  V'P  -  0,  then  by  Sylvester's  law  of  nullity 
rank(V)  <  k  -  r,  ands 

Y'£t  "  Y'£t  (2.14) 

now  we  prove  that  any  matrix  V  that  verifies  (2.14)  must  verify  (2.13),  as 

5t  "  i(B)-1e(B)at  -  $(B)at 

-t  ■  i(B)*t "  »t 

and  if  V'P  -  0 

V«£t  -  y4(B)at 

and,  as  V’e^  must  be  a  white  noise  sequence, 

V'e  -  V'e.  (2.14) 

“  ■*t  “  “t 

V'^  -  0  V  i  >  1  (2.15) 

and  so,  if  we  take  A  «  V'  the  proof  is  complete.  Note  that  there  always  exist  a  matrix 
V'  such  that  V'P  -  0.  we  can  take,  for  example,  as  first  k  -  r  rows  of  J'  a  set  of 
the  k  -  r  lineary  independent  eigenvectors  of  PP'  that  are  linked  to  sero  eigenvalues, 
and  put  in  the  leftover  r  positions  any  r  vectors  of  the  linear  subspace  generated  by 
the  first  k  -  r  rows. 

An  important  conclusion  from  Theorem  1  is  that  the  autoregressive  matrices  of  the 
multivariate  observed  process  zt  must  satisfy  (2.8)  and  so 

*z(*>e  ■  ey*>  (2-i6) 

which  has  the  general  solution 

iE(k)  “  Ely(k)E~  +  £(I  ”  S£_)  (2.17) 

where  P  is  any  generalized  inverse  of  P  that  satisfy  PP  P  ■  P  and  C  is  any 


arbitrary  matrix  with  tha  only  restriction  that  the  root*  of  ||8(B)|  ara  on  or  outside 
tha  unit  circle.  As  tha  matrix  4y(k)  is  diagonal,  aquation  (2.16)  shows  that  tha  columns 
of  P  ara  eigenvectors  of  with  eigenvalues  tha  diagonal  elements  of  fcy(k). 

However,  the  matrix  |8<k)  can  have  any  rank,  due  to  tha  presence  of  the  arbitrary  matrix 
C.  On  the  other  hand,  according  to  Theorem  2,  for  every  possible  solution  £s(k)  there 
will  be  a  set  of  8^(k)  matrices  that  must  verify  restriction  (2.15).  For  example,  if 
Py  ■  1,  qy  “  0,  and  the  vector  of  components  follows  a  multivariate  AR(1)  process,  the 
system  of  equations  to  determine  the  parameters  is  obtained  as  a  particular  case  of  (2.10) 
to  (2.12)  after  seme  straightforward  algebra,  to  be 

It  ♦  Oaf  -  *u  -  ♦  UuSi  (2-18) 

5ei;  -  U;  <2-i9> 

M  "  Sty 

and  any  set  of  matrices  t  ,  8^  that  satisfy  this  system  is  a  solution  of  the 

syetem.  Although  there  are  infinite  solutions,  they  can  be  characterised  as  follows. 

First,  *  is  given  by  (2.17)i 

™Z 

t»  ”  £iy£“  +  "  »‘> 

and,  given  4  ,  we  have  in  (2.18)  k(k  +  1)/2  equations  (because  of  its  symmetry)  plus 
z 

k2  in  (2.19),  that  is  equal  to  the  number  of  unknown  parameters.  Furthermore,  using 
result  (2.14) 

2**1  -  2}(te  -  *  2  <2-2” 

which  implies  that,  although  4  and  6  can  have  any  rank,  their  difference  must  have 

*z  ®z 

rank  no  greater  than  the  number  of  components  in  the  system.  Also,  suppose  that  h  is  an 
eigenvector  of  the  matrix  ^  -  8^,  then  (+*  -  8^ )h  «  Xh,  and  assume  that  X  is 
nonzero.  Then,  if  v*  is  any  row  of  J  as 

“  2,)$  “  Xvjh  -  0 

and  h  must  be  orthogonal  to  the  subspace  generated  by  the  matrix  g,  and,  therefore, 
belongs  to  the  subspace  generated  by  P. 

In  sumstary  when  building  a  multivariate  time  series  model  for  a  vector  of  eeries  *t 
that  ia  driven  by  a  number  of  common  factors  as  in  (2.1)  it  may  appear  that  a  complicated 


Model  with  all  kind*  of  dependency  ia  needed.  In  particular  we  may  finish  with  a  model 


showing  feedback  among  all  the  coaiponents  of  the  vector  st.  However,  the  parameter 
matrices  in  this  case  must  satisfy  many  restrictions.  There  may  be  summarise  as  follows: 
(1)  all  the  autoregressive  matrices  will  have  r  contaon  eigenvectors  that  define  the 
subspace  |  of  the  factors  and  (2)  all  the  |  matrices  must  have  rank  equal  to  r  and 
their  columns  must  belong  to  the  subspace  generated  by  P. 


2.4  A  Canonical  Transformation 

An  important  problem  is  to  find  a  transformation  of  tha  observed  series  zt  that 
allows  us  to  recover  the  factors.  To  sxplain  how  to  find  such  a  transformation  first,  note 
that,  if  P-  is  any  generalised  inverse  of  £,  if 

=t  '  s*t  + 

then 


second,  suppose 
then 


-  £  £t  -  S  £t 

that  we  apply  a  linear  transformation  M  to  where  M 


P  a.  y  ♦  P  t.  x 

X  -  Hr  -  [ t\  -  [  "  _t  I  -  [-1t] 

*t  LB«t  i  lBPxt  +  KtJ  lX2tJ 


To  recover  the  r  factors  5  should  be  chosen  such  that  BP  •  0.  Then,  the  first 

r  components  of  xfc  will  be  equal  to  the  vector  of  common  factors  plus  some  added 

noise,  and  the  second  k  -  r  components  will  be  just  white  noise.  A  matrix  B  that 
satisfies  this  conditions  may  be  obtained  putting  as  rows  of  B  the  k  -  r  eigenvectors 
linked  to  sero  eigenvalues  of  PP*.  Calling  this  eigenvectors: 

PP’  [v1 , . . .  ,Vjt_rI  -  0  -  PP'V 


that  implies. 


n  -  & 

and  so  B  •  V  has  the  desired  property.  Although  the  selection  of  P~  could  be 
arbitrary,  it  seems  sensible  to  choose  a  generalised  inverse  that  leads  to  canonical 


variables  as  staple  as  possible.  A  convenient  property  is  that  the  g1  and  x2 
coaponants  of  the  new  vector  be  independent,  but  this  is  not  in  general  possible  unless 
£g  -  X.  Assisting  this  equality,  them 

*l5it52tJ  *  *■£.*•  -  ir 

and  if  ve  choose  j'  as  the  Moore-Penroe  generalised  inverse  of  J,  given  by  (£'1)  V' 
the  covariance  matrix  will  be  zero  and  the  coaponants  will  be  independent. 

to  illustrate  this  transformation  1st  us  suppose  that  follows  an  r-diaensional 

ARM)  process,  then  gt  is  AAKA(1,1)  and  so  is  any  linear  transformation  of  gt. 
However,  the  parameters  of  the  k-dimanslonal  process  given  by  xt  ■  g£t,  where i 


will  be 


tx-a tji 

lx*  *■*“ 


li  1  si 


as  can  be  verified  by  direct  multiplication  Jflf 1  -  {{“’{J  -  |.  Ho, 

|  <ri>“Yw  I 


and  using  the  general  expression  (3.20)  for 


In  the  same  way,  calling  6Q  the  value  of  associated  with  ^  *  wh,n  ft  “  2 

then,  for  Theorem  2,  the  general  expression  of  is 

U  -  So  +  -  SS‘» 


and  so 


<s*s>_1s’a0s  <s‘i>-V2,x 


where  we  have  usad  that,  for  (2.21)  -  X'®,*  ®°»  calling  x£  “  l5}t'-2t^  *nd 

Into  account  that  tha  addition  of  MA  process  Is  a  new  HA  process i 


<J  -  “  <J  -  2XB»B1t 


-2t  "  -2t 


that  allows  us  to  recover  the  components  mixed  up  with  noise  in  x.t . 


2.5.  Some  Simulation  Results 

We  present  here  two  simulations  to  show  how  the  above  theory  can  be  expected  to  work 
in  practice.  The  results  we  show  are  representative  of  the  many  simulations  we  have  made . 


A  sample  of  100  observations  of  a  vector  of  4  time  series  was  generated  according  to  the 


.8  .2 


:i  “  (&  *-) 


with  E£  -  1,  Vjjt  »  afc  and  £a  “  t0  1  5J  Table  la  shows  the  eigenvalues  of  the 

covariance  and  correlation  matrices  and  the  eigenvectors  linked  to  the  nonsero 

eigenvalues.  To  explain  these  results,  first  note  that  as  the  two  factors  are 

nonstationary  their  sample  variances  are  much  larger  than  the  diagonal  elements  of  I£  and 

so  T  (0)  has  a  very  similar  structure  to  T  (k)  for  k  >  1.  This  means  that 
z  i 

r  <t)  «  pr  <i)p'  v  t  . 

-z  ««y  ■ 

Second,  as  the  factor  are  completely  independent  and  with  similar  variances, 

T  (i)  =<  y  (1)1,  where  Y  (1)  ia  a  covariance  scalar  function.  Then 

sty  y  r  y 

r  (i)  a  Y  (l)PP'  v  1 

»y  y  — 

and  the  eigenvalues  of  r^(i)  are,  approximately,  proportional  to  the  eigenvalues  of  gg> 
and  with  the  same  eigenvectors.  The  two  eigenvectors  of  PP’  linked  to  nonsero 


•  v-v-w. 


•\ 


covariance  matrices,  and  (3)  p< 2)  ia  amall  and  nearly  aero,  These  raaulta  suggest  that 
the  aodel  for  the  comaton  factora  la  probably  ar<1)  and  confira  mildly  the  hypotheaia  of 
two  factora  leading  the  ayatea. 


TAB!*  1b 

Partial  Autocorrelation  Matricea 


Note  PU)  >0  for  t  >  2 

Table  2  ahowa  the  eigenvaluea  of  the  eat lasted  parameter  aatricea  and  of  ita 

differency.  Note  that,  aa  *  I,  then 

*  P  -  PI  -  P 
—  ” 

and  for  any  nonalngular  aatrlx  C 

ijs-ss 

and  ao,  any  linear  tranaforaation  of  the  coluana  P  my  appear  aa  eigenvaluea  of  The 

rank  of  the  difference  £  -  8  la  nearly  2,  which  confirm  the  exiatence  of  two  factora. 
Note  that  the  two  largeat  eigenvaluea  of  are,  in  thia  caaa,  similar  to  the  eigenvalues 

of  the  covariance  matrices. 

Taking  this  two  eigenvaluea  as  factor  loadings  to  build  the  transformation,  the  new 
vector  of  transformed  aeries  will  be  given  by 


P(1) 


.97 

.87 

.19 

.10 

.65 

.27 

-1.39 

-.28 

.68 

.65 

.76 

.51 

.66 

-.35 

-.89 

-.62 

.50 

-.65 

.52 

.52 

P<2) 


-.14 

.21 

.044.091 

-.57 

.30 

.62 

.60 

complex 

.11 

-.30 

-.41 

-.86 

74 

-.58 

.24 

-.24 

37 

-.06 

-.74 

.56 

42 

.62 

.01 

-.19 

29 

.36 

.55 

.57 

and,  using  thia  transformation,  the  new  series  x^  » 


ast  will  have  parameters 


TABLI  2 


Eigenvalues  of  thj  Istlmatsd  Htwtit  Matrices 


r  ' K 

'  V  V* 

-  TT «  *. 


1.0  .90  -.77  .20 

.64  .16  .02  -.IS 

.97  .15  .41  .96 

-.22  .73  1.03  .17 

-.67  .87  -.30  -.14 


-.99  -.14  .28*. 091 

.27  .43 

.58  1.41  - .  — 

.75  .33  eo*pl“‘ 

-.12  .16 


.96  .73 


.25  .06  I 


.42  .31  -1.15  .71 

.72  .46  0  .67 

.50  -.42  -.02  -.43 

.22  -.69  .29  .82 


with  Af  *  t]  hu  rank  equal  to  too.  So  m  will  obtain  two  random  component  plua  the  two 
stationary  factors  contaminated  with  some  white  noise /  as  expected. 

The  main  conclusions  from  this  exercise  are  as  follows i  (1)  Although  the 
determination  of  the  number  of  factors  using  the  rank  of  the  matrices  may  be  not  clear  for 
soma  matrices,  putting  together  all  the  different  pieces  of  information  that  can  be 
obtained,  the  picture  is  normally  quite  clear.  (2)  The  factor  loadings  can  be  obtained 
with  reasonable  accuracy  from  the  eigenvectors  linked  to  large  eigenvalues  of  the  6 
parameter  matrices. 

CASS  2 

The  generating  equation  and  the  sample  sice  in  this  second  example  are  the  same  as  in 
Case  1.  However,  now  only  one  factor  is  nonstationary  and  it  has  been  allowed  some 
correlation  aaung  both  components .  The  factor  time  series  model  is 

u-i-;  i>)  53 

Table  3a  diaplays  the  eigenstructure  of  covariance  and  correlation  matricas  and  Table  3b 
that  of  the  partial  autocorrelation  matrices. 


fW' 


JL 


Jil 


X 


a)  Largest  eigenvalue  and  its  eigenvectors 
Covariance 

Tag  I  0  1  2  3  4  ~ 


Correlations 


0 

1 

2 

3 

4 

5 

0 

1 

2 

3 

4 

5 

54.10 

44.63 

40.66 

35.87 

32.46 

30.74 

2.98 

2.2 

1.9 

1.6 

1.4 

1.3 

.22 

.48 

.17 

.15 

.14 

.13 

.53 

.43 

.40 

.35 

.32 

.28 

.15 

.08 

.05 

.02 

.02 

.02 

.43 

.25 

.16 

.07 

.06 

.04 

.63 

.63 

.63 

.63 

.63 

.62 

.57 

.61 

.63 

.64 

.64 

.64 

.73 

.75 

.75 

.76 

.76 

.77 

.51 

.61 

.65 

.68 

.69 

.71 

b)  Second  largest  eigenvalue  and  its  eigenvector 


TABLE  3b 

Eigenvalues  and  Eigenvectors  of  the  Partial  Correlation  Matrices 


.68  -.29 

.52  -.23 

.17  -.79 

35  1.13 


Cosiplex 


Th*  aain  conclusion*  of  both  autocovsrlanc*  and  autocorrelation  Mtrioss  ars  that  (1) 
thar*  are  probably  two  cowon  components*  (2)  on*  ia  such  aor*  important  than  th*  othari 
(3)  on*  saw*  to  b*  nonatationary  taking  into  account  th*  slow  decreasing  of  th*  largest 
eigenvalues  >  whereas  the  other  a  seats  to  b*  stationary. 

It  ia  important  to  sea  th*  differences  with  raspect  to  Case  1.  If  we  compare  Table  la 
with  Table  3a  it  can  be  seen  that  the  second  largest  slgsnvalu*  in  Table  1  decreases  slowly 
and  th*  eigenvector  is  fairly  stable  even  for  lags  as  far  as  five.  In  contrast  with  this, 
th*  second  eigenvalue  of  Table  3a  decreases  quickly,  and  is  clearly  sero  at  lag  five,  as 
shown  not  only  but  its  nail  value  but  as  wall  by  th*  change  of  structure  of  its  linked 
eigenvector. 

The  partial  autocorrelation  Mtrices  show  that;  as  far  as  the  rank  of  these  Mtrlces  is 
concern e<V  the  observed  results  can  differ  from  theory.  Th*  only  clear  indication  in  this 
css*  is  that  th*  general  pattern  of  th*  eigenvectors  linked  to  th*  two  largest  eigenvalues 
is  not  in  contradiction  with  th*  previous  results. 

Table  4  presents  th*  eigenatructur*  of  the  estiMted  parameter  Mtrices.  As  in  the 
previous  simulation  th*  final  estiMted  modal  has  many  significant  coefficients  both  in  th* 
autoregreesiv*  and  moving  average  Mtrices,  and  it  Maas  to  repreMnt  a  quite  complex 
relationship  among  th*  series.  However,  th*  Mtrix  |  -  8  has  two  larger  eigenvalues 
similar  to  those  of  P(1). 

If  we  accept  th*  hypothesis  of  two  Min  factors  and  choose  the  two  largest 
eigenvectors  of  |  to  define  th*  Mtrix  of  factor  loadings  (,  the  resulting  paraMter 
Mtrices  for  th*  xt  -  M*t  aeries  can  be  decomposed  as  befors  as  th*  si*  of  two  Mtrices, 
on*  of  which  cancels  in  both  term*.  Th*  simplified  paraMter  Mtrices  arei 


showing  on*  nonstationary  factor  and  a  second  stationary  factor. 


3.  THE  ONE  FACTOR  MODEL 


3. 1  Introduction 


Hie  one  common  factor  model  la,  on  the  one  hand.  Important  In  Its  own  right  because  it 
describes  the  relationship  of  several  measurements  over  time  of  the  same  dynamic 
variable.  On  the  other  hand,  this  situation  is  going  to  appear  approximately  in  the 
multifactor  situation  when  one  of  the  factors  has  much  greater  variance,  and  for  this 
reason  is  more  important  than  the  others. 

We  will  assume  in  this  section  that  in  the  general  representation  (1.1)  P  is  now  a 
k  x  1  vector.  He  will  concentrate  here  in  the  case  that  yt  follows  an  AR(1)  process 
and  will  analyse  the  behavior  of  the  system  as  the  parameter  of  yt  approaches  the 

unit  value. 

With  this  set  up,  every  component  rlt  of  jt  will  follow  an  ARMA  (1,1)  model  with 
the  same  parameter  ,  and  with  parameters  8^  that,  as  can  be  seen  equating  moments 
must  verify: 

ei*y  “  VUl  +  1  +  ♦  ♦y  -  0  «3-1> 

2  2 

Pi°a  2 

where  y^  “  — ,  being  p^  the  ith  component  of  P,  the  variance  of  the  noise  in 

0i  2 

the  model  for  yt  and  the  variance  of  the  ith  component  of  the  measurement  error 

vector  It  can  readily  be  seen  from  (3.1)  that  the  parameter  0^  in  each  series 

depends  on  y^ .  If  y^  ♦  »,  which  means  that  the  measurement  error  can  be  ignored,  0  ^ 
tends  towards  sero,  and  if  yi  ♦  0  everything  is  dominated  by  the  noise  and  0 ^  , 

which  means  that  this  component  will  be  white  noise. 


3.2  The  Covariance  and  Correlation  Metrical 

Calling  yy(k)  to  the  autocovariance  coefficient  of  order  k  in  aeries  yt, 

^(0)  -  PP*Yy(0)  +  re 
r£(k)  -  pp'ry(k)  k  >  i 

and  for  k  >  1,  (1)  all  the  covariance  matrices  will  he  aymmetricj  (2)  all  will  have  the 

rank  equal  to  one;  (3)  all  will  have  an  eigenvector  equal  to  P  linked  to  the  nonzero 
eigenvalue;  (4)  all  nonzero  eigenvalues  will  be  proportional  to  the  autocovariance  or 
autocorrelation  function  of  y^ . 

So,  in  the  special  case  in  which  yt  will  follow  an  AJU 1 )  the  nonzero  eigenvalues 
of  the  covariance  matrices  must  decrease  exponentially. 

The  partial  autocorrelation  matrices  can  be  obtained  from  equations  (2.5)  and  (2.6). 
The  first  term  is 

P<1)  ”  ♦  Y  COX'*  +  ary(0))-1PP,r1 
where  now  a  -  P'£-1P,  end  so 


P(dp 


♦  (■ 

y 


aYy(0) 

1  +  ary<0) 


J  l 


that  shows  clearly  that  if  the  signal  to  noise  ratio  aYy(0)  is  much  greater  than  1,  the 
first  partial  autocorrelation  matrix  will  have  an  eigenvector  P  linked  to  the  eigenvalue 
♦y.  Note  that  in  all  cases  P  will  be  an  eigenvector  of  F ( 1 >  to  difference  from  the 
multifsctor  situation. 

For  the  others  terms,  using  (2.5) 


1-1  1-1  , 

f(f)  -  p(r  (*>  -  2  l  r  (i  -  i)yv (3)p'a‘  (i,j,i)P>P'*B' d) 
y  i-i  j-i  y  y 


where 


fiU) 


(Yy(0)J|'  +  5e 


1-1  i-1  . 

I  l  vm  cmkV  u,jDgp’) 

i-1  j-1  y  y 


calling 


1-1  1-1 


ai  “  V0>  '  ^  ^  Y  (i)lf  (j)P’A_1(l,j,t)£ 


i“1  3-1 


1-1  1-1 


f,  -  r  <i>  -  l  l  Y  (t  -  i)Y  (j)P'A 
*  y  I-*  4  y  y 


i-i  j-l 


P(l)  -  f.PP'td.PP1  +  I  ) 

-  1“  «e 


and  using  the  expression  for  the  Inverse  of  the  sub  of  two  matrices  one  of  them  being 


nonsingular  (see  Section  2) 


and  so  P(i)  will  have  an  eigenvector  equal  to  p  with  eigenvalue  af ^ ( 1  +  ad^)  .  It 
can  be  seen  from  (3.2)  that,  again,  if  the  signal  to  noise  ratio  is  large,  the  eigenvalue 
linked  to  the  P  eigenvector  of  £(t)  will  follow  the  partial  autocorrelation  function  of 
the  component  yt • 

In  particular,  if  yt  follows  an  AR(1),  £{*)  will  be  near  rero  for  t  >  1  if  the 

signal  to  noise  ratio  is  large.  Otherwise,  £(t)  will  be  different  from  zero  and  will 
have  a  nonzero  eigenvalue  linked  to  P. 


3.3  The  Parameter  Matrices 

The  parameter  matrices  of  the  process  can  be  obtained  equating  the  representation  of 
the  covariance  matrices  implied  by  the  one  factor  model  to  the  covariance  matrices  for  an 
ARMA  (1,1)  vector  process.  Then 

r0  -  ££'Yy<0)  +  lc  -  -  *^9’  ♦  9^2-  (3.5) 

r,  -  pp'y  (i)  -  r.*'  -  i  9*  (3.6) 

-1  —  y  -0-  — u— 

r  -  pp'y  (k)  -  r  k  >  2  (3.7) 

«k  —  y  -k-1- 

where  #,  0  and  are  the  matrices  that  define  the  observed  vector  process  zt,  and  we 

2 

use  4y  and  a #  to  represent  the  parameters  of  the  AR(1)  model  for  the  common  factor 
yt.  Using  (3.7)  for  k  *  2  it  is  readily  seen  that 


V  N.*  N.'  •.* 


/•v v-y.y.>y .  . .  * 


-  .> . 


■I-  .•  ’y.y.y 


s«vv 


(3.6) 


iZ  -  4yP  (3.8) 

and  ao  must  ba  an  eigenvalue  of  |  linked  to  tha  eigenvector  p.  Tha  general 
aolution  of  (3.8)  la  (saa  Rao  and  Mltra  (19?1))i 

•  -  (P‘P)~,»y>P'  +  4^(1  -  (P'P)“1PP‘)  (3.9) 

where  C  la  any  arbitrary  matrix  with  tha  only  raatrictiona  that  tha  aiganvaluaa  of  4 
atuat  ba  on  or  inaida  tha  unit  clrcla.  Eliminating  tha  covariance*  from  (3.3)  and  (3.6) i 


^  -  £e  +  +  HuS*  - 

lei'  ’  tuS' 


(3.10) 

(3.11) 


Bquutions  (3.8),  (3.10)  and  (3.11)  dafina  tha  raatrictiona  which  auat  ba  obayad  by  the 
paraneter*  of  tha  ayataa.  On  tha  whole,  tha  ayataa  haa  not  an  unique  aolution  bacauaa 
there  are  more  unknown*  than  aquationa.  However,  once  J  ia  aaauaad  fixed  according  to 
(3.9)  tha  parameter*  aatricaa  6  and  are  determined  by  (3.10)  and  (3.11).  It  can  ba 

verified  that  tha  aolution  of  theaa  aquationa  iai 


s.  < 

v.’i 


a-i-^TT-XbS's;1 

tu  -  XSE*  +  £e 

whara  •  ia  given  by  (3.9),  b  -  P'E  ^  and  X  verified 


bX2  -  X(bo2  +  42  -  1 >  -  a2 


(3.12) 


(3.13) 


(3.14) 


and  correaponda  to  tha  aolution  with  9  having  ita  aiganvaluaa  on  or  inaida  tha  unit 

2 

circle.  Mote  that  if  4y  ♦  0,  X  ♦  o‘a,  aa  expected.  Equations  (3.9),  (3.12)  and  (3.13) 
auaaarixe  tha  relevant  propertiea  of  the  aatricaa  of  the  proceaai  ( 1 )  £  nay  have  any 
rank  but  nuat  have  P  aa  an  aiganvactor  with  eigenvalue  4yj  (2)  £  nay  have  any  rank 
but  again  auat  have  P  aa  an  aiganvactor  with  eigenvalue i 


"e  i  ♦  xb 

and  uaing  (3.14),  tha  axpraaaion  of  Xg  aa  a  function  of  tha  paraaetera  of  tha  coomon 
factor  iai 


My  •  Xe<b0«  +  *y  *  11  +  *y  “  0  (3-15> 

and  if  we  coopered  (3. IS)  to  (3.1)  is  clear  that  Xg  will  be  similar  to  the  univariate 
parameter  of  the  series,  although  smaller;  (3)  £  -  9  must  have  rank  equal  to  one,  and  p 

will  be  the  eigenvector  linked  to  its  nonsero  eigenvalue;  (4)  the  matrix  ^  will  be,  in 
general,  a  full  rank  matrix. 


3.4  The  Canonical  Transformation 

In  the  one  factor  case,  the  canonical  transformation  is  the  matrix  of  eigenvectors  of 
PP' .  Calling  J}'  this  matrix,  the  covariances  of  the  transformed  process  ■  J}*t  will 
be 

£*<*>  - 

and  so 


rx(k)  - 


k  >  1 


£x<0)  “ 


V*2i 


O' 

2 


+  HI  H* 


and  the  residual  covariance  matrix  will  be 


I  »  Kt  M' 
-x  **u* 


XIp‘ 


+  Ml  M’ 


The  covariance  of  the  factors  and  the  canonical  components 

rxs(0>  -  Etx^;]  -  mpp-y0  ♦  M£e 


is 


and,  in  the  particular  case  in  which  the  variance  of  all  the  measurement  errors  are  the 


4. 1  Formulation 


If  we  remove  the  restriction  that  the  parameter  matrices  of  the  ASHA  model  for  the 
factors  are  diagonal,  some  uniqueness  conditions  are  needed  to  determine  the  model.  One 
suitable  normalisation  is  to  impose  p'p  -  I,  that  means  that  of  all  the  C  matrices t 

*t  ■  ss-1at  ♦  st 

we  choose  C  such  as  (C^'F'PC-1  “  I.  Other  possible  selections  are  to  choose  £  to 
diagonalize  some  of  the  parameter  matrices  of  the  factor  model.  For  instance,  if 
follow  a  multivariate  ARC  1 ) ,  C  could  be  chosen  to  diagonalize  £  which  leads  back  to 
the  uncoupled  case.  However,  in  the  general  case  in  which  yt  follows  a  general  AIWA 
(Py>qy)  model,  the  condition  that  C  has  to  diagonalize  any  particular  parameter  matrix 
is  arbitrary.  Thus,  it  seems  more  natural  to  link  C  to  the  properties  of  the  factor 
loading  matrix. 


4.2  Properties  of  the  SMdel 

Table  4.1  summarizes  the  differences  between  the  basic  properties  of  the  uncouple 
factors  and  the  couple  or  dependent  factors  model.  The  main  difference  arises  related  to 
the  eigenvectors  of  the  covariance  and  autoregressive  parameter  matrices  that  are  no  longer 
columns  of  the  factor  loading  matrix.  Equations  (2.4)  and  (2.16)  do  hold  in  this  case  but 
the  conclusions  we  have  to  draw  from  them  are  different.  For  instance,  starting  with  (2.4) 


T  (k)  -  pr  (k)P'  k  >  1 
■z  “y  " 

and  assuming  that  T  (k)  -  O.D.u"1  where  D.  is  diagonal,  is  clear  that  £  (k)  has  the 

■y 

same  eigenvalues  than  T  (k)  but  with  eigenvectors  PU.  ,  V,  where  the  columns  of  V 

*y  ■  * 

belongs  to  the  null  space  of  PP ' .  To  see  this  write 


£,<*>  -  t£Sk  ■  v] 


D.  0 

-k  “ 


(TV 

-k  - 


0  0 

that  shows  clearly  the  eigenstructure  of  £B(h).  Of  course  if  T  (k)  is  diagonal  "  J 


and  we  obtain  the  uncoupled  case 


As  for  tha  autoregressive  parameter  matrices ,  the  condition  la 


and  assuming  that  g^(k)  haa  lineary  independent  eigenvectors  and  writing 
£y(k)  -  1 '  wh*r*  Sy  ia  diagonal  and  containa  tha  aiganvalaaa  of  |y(k). 

TABLE  4.1 


Property 

Uncouple  Factors 

Couple  Factors 

rank  <Iz<k))  -  r>  k  >  1 

Yes 

Yas 

r^tk)  symmetric 

Yes 

No 

eigenvectors  of  £z<k)  columns  of  j 

Yes 

No 

eigenvalues  of  T  (k)  eigenvalues  of  T  <k) 

mg  my 

Yes 

Yes 

rank  (P(i))  -  r 

Yes 

Yes 

st  ~  ARMA  <pz  -  py,  qz  -  maxlpy ,qy) ) 

Yas 

Yes 

rank  (JA)  -  r  i  >  1 

Yes 

Yas 

eigenvalues  of  ^(k)  are  eigenvalues  of  £^<k) 

Yes 

Yes 

eigenvectors  of  $2 (k)  are  eigenvectors  of  j^(k) 

Yas 

No 

l8(k)PSfk  -  pwkfk 

that  ahowa  that  the  eigenvaluee  of  ^  (k)  are  eigenvalues  of  |z<k)  aa  well,  with 
eigenvectora  Fj{k . 

To  build  the  canonical  tranaforaation  we  need  to  determine  the  null  apace  of  PP’ . 
Suppose  that  H  is  the  matrix  of  eigenvectora  of  the  first  autoregressive  parameter 
matrices  of  tha  observed  aeries  It .  Then,  r  columns  of  H  are  ,  whereas  the 
other  k  -  r  columns  are  arbitrary.  He  can  partition  H  and  H  1  as 

H  -  [PH,  R,]  H 

and  obtain  and  •  Th®  determination  of  these  columns  can  be  made  because  (a) 

r  is  the  rank  of  £z<k),  P(  1)  and  and  can  be  determined?  (b)  these  columns  are 


W  P' 

si  £ 


linked  to  the  eigenvalues  of  |^(1)  that  are  approximately  those  of  P(1).  Then  as 

(PW1)(W“1P* )  -  PP' 

the  null  space  of  ( PVj ^ )  ( W~ ^ P 1  )  is  the  same  as  that  of  gg*  and  can  be  obtained 

taking  the  eigenvectors  linked  to  zero  eigenvalues  of  this  matrix.  The  transformation 


will  produce,  as  the  uncoupled  case,  a  decomposition  of  the  vector  into  two 

components:  The  first  will  contain  the  factors  mixed  up  with  noise,  and  the  second 
component  will  be  a  vector  of  white  noise. 


Si  *t  +  Si  St 


St  *  BSt  - 


An  interesting  property  of  the  transformation  is  that  the  components  obtain  in  x1t 
will  be,  in  practice,  weekly  related.  For  example  if  yt  follows  an  AR(1),  will 

follow  an  AR( 1 )  process  too,  but  with  a  diagonal  autoregressive  parameter  matrix. 


80M1  PRACTICAL  CGMKRNT8  OH  THR  APPLICATIOH  OP  THR  THROAT 

Thera  are  thraa  Min  dacisiona  that  hava  to  ha  takan  to  apply  tha  following  thaory  to 
a  raal  data  aat  of  tiM  aerlea.  Firat ,  tha  nuabar  of  factor  ha a  to  ba  identified) 
second, tha  factor  loading  Mtrlx  £  ha  a  to  ba  estiMted,  and  third,  tha  dacoupling 
tranafonation  M  haa  to  ba  built.  Wo  will  briafly  coaiaant  on  thaaa  aubjacta  on  tha  baala 
of  our  axparianca. 

Tto  idantify  tha  nuabar  of  factora  wa  hava  four  diffarant  tool a  at  our  dlapoaalt  Tha 
covariance  Mtricaa,  tha  corralation  Mtricaa,  tha  partial  autocorralation  Mtricaa  and  tha 
|  paraMtar  Mtricaa.  Wa  hava  ah  own  that  tha  rank  of  all  thaaa  Mtricaa  ahould  ba  aqual 
to  tha  nuabar  of  ccaaaon  factora  driving  tha  ayataai. 

Howavar ,  whan  tha  varianca  of  ona  of  tha  faotora  ia  vary  largo  compared  to  tha 
variance  of  all  tha  other  factora,  tha  identification  of  tha  diMnaion  of  tha  factor  apace 
ualng  tha  rank  of  tha  covariance  and  corralation  Mtricaa  can  ba  alaleading.  Buppoaa  for 
lnatanca  that  tha  factora  follow  an  AR(1)  aodal ■  Than  if 


-  lyBJXt  -  at 


tha  covariance  of  v  will  ba 


'£y 

£/“  -  -  1)*J  »>1 

Buppoaa  that  Z*  ia  diagonal.  Than,  uaing  tha  apeetral  dacoatpoaition  of  £^(0) 

£y<0)  “  j,  "iVi 

whore  u1  are  tha  aigerrvaluaa  of  £^(0)  and  v^  ita  olganvactora.  Aa  £y(0)  ia 
diagonal,  u ^  are  alao  tha  varianca  of  tha  coaponenta,  and  if  tha  varianca  of  tha  firat 
factor  ia  much  larger  than  tha  variance  of  tha  othera 


Iy<°>  "  "voSi'OSf.O 


and  in  general 


£y(k)  "  ul.k*l,k*i,k  ( 

where  k  la  tha  largaat  aiganvalue  and  j1  ^  ita  aaaociatad  algonvoctor.  Than 


where  h  -  Pv  .  So,  all  the  covariance  matrices  will  be  dominated  by  a  large 

” '  /  j 

eigenvalue  and  will  have  rank  equal  to  one.  In  particular,  if  all  the  components  are 
nonstationary  and  -  I,  then  h1f0  ■  h1f)t  for  all  k  and  the  covariance  matrices  will 
display  the  structure  we  have  obtained  for  the  one  factor  model. 

However,  as  the  columns  of  P  are  eigenvectors  of  whatever  the  variance  of  the 

factors,  if  ”  J/  $z  will  still  have  at  least  r-roots  on  the  unit  circle  and  at 
least  r  eigenvalues  equal  to  one,  linked  to  the  P  vectors.  For  this  reason  the 
identification  of  the  number  of  factors  looking  only  at  the  rank  of  the  covariance  matrices 
could  be  misleading.  Note  that  this  problem  is  not  resolved  by  looking  at  the  correlation 
matrices  because,  calling  gz<k)  the  autocorrelation  matrix  of  order  k. 

Rz<k)  -  Dlag(rz(0))-t/2ri(k)Diag(riE(0))"1/2 

and  using  (5.1) 


R  * 
mz 


"i.kii.nM 


where 

h,  -  niag<£B(0>r1/2e1fJt  . 

and  the  same  situation  will  appear. 

Hie  rank  of  the  nonzero  partial  autocorrelation  matrices  can  also  be  misleading  when 
£^<0)  is  almost  singular.  Then  the  precision  of  the  computation  of  these  matrices  is 
very  low  and  the  determination  of  how  many  roots  of  the  partial  autocorrelation  matrix  can 
be  considered  equal  to  zero  is  not  easy.  As  a  general  rule,  the  simulations  we  have  made 
have  shown  that  estimated  eigenvalues  as  large  as  .5  can  appear  for  theoretically  zero 
values  in  the  partial  autocorrelation  matrices  when  the  variance  of  the  noise  is  small 
compared  to  the  variance  of  the  signal. 

The  rank  of  the  jj  matrices  seems  to  be  the  safest  way  to  determine  the  number  of 


factors.  When  there  is  disagreement  between  the  three  features  we  have  analyzed,  a 
conservative  strategy  is  to  start  assuming  a  small  number  of  factors  and  proceed 
iteratively,  building  the  components  and  analyzing  the  noise  for  further  structure. 


The  factor  loading  Matrix  |  should  ba  determined  froai  tha  eigenvectors  of  tba 
Maxi—  likelihood  aatiaator  of  tha  autoregressive  paraaatar  Matrix  Only  in  tha  ona 
factor  caaa  tha  coIqmzi  t  could  ba  obtalnad  diractly  froM  tha  covariance  or  partial 
autocorralation  sat rices.  A  wall  known  rule  in  Matrix  ooaputation  (aaa  Stewart  (1973))  ia 
that  tha  eigenvectors  of  a  Matrix  ara  wall-condition ad  only  whan  tha  Matrix  haa  distinct 
and  wall-aaparatad  eigenvalues ,  wharaaa  if  tha  Matrix  haa  rapaatad  or  oloatarad  eigenvalues 
tha  aatiaation  of  ita  aiganwaot ora  May  ba  il 1-condition ad.  This  mo  ana  that,  on  tha  ona 
hand,  wa  can  axpact  a  good  agreeawnt  in  tha  ona  factor  caaa  aaong  tha  diffarant  estimations 
that  wa  have  of  tha  wactor  of  factor  loadings)  on  tha  other  hand,  when  wa  have  several 
nonstationary  factors,  wa  hava  to  expect  thia  saaa  number  of  unit  roots  in  tha 
autoregressive  matrix  and  tha  aatiaation  of  tha  eigenvectors  is  not  expected  to  ba  vary 
accurate.  For  thia  reason,  even  if  tha  factors  ara  Independent,  wa  can  finish  with  factors 
that  have  ssuill  correlation  or  with  final  noise  processes  that  ara  not  exactly  white  noise 


processes • 


Figure  1  shows  ths  series  of  the  pries  of  wheat  in  five  provinces  of  Castillia,  Spain 
fro*  July  1880  to  Deceaber  1890.  TTierse  provinces  are  Avila,  Palencia,  Segovia,  Valladolid 
and  Zasora.  The  source  of  the  data  is  Sanchea-Albomos  (1975)  and  Vena  and  B&nchez 
Albornos  (1983)  have  studied  sosm  of  these  series  during  the  second  half  of  the  XZXth 
century,  pointing  out  their  lack  of  seasonality. 

The  analysis  that  follows  is  made  with  the  series  in  logs,  although  the  conclusion  are 
not  depending  on  the  particular  transformation  used. 

Table  5  shows  the  univariate  model  for  the  five  series.  These  models  are  broadly 
consistent  with  the  hypothesis  of  some  common  factors  following  an  AR( 1 )  model.  Table  6 
displays  the  eigenvalues  and  eigenvectors  of  the  covariance  matrices  up  to  lag  6.  There  is 
a  strong  eigenvalue  that  decreases  slowly  linked  to  a  fairly  stable  eigenvector.  The 
second  eigenvalue  is  rather  small  and  the  eigenvector  is  not  completely  stable.  Finally, 
the  other  three  eigenvectors  are  practically  aero.  Then,  it  Mams  there  are  at  least  one 
important  cooskhi  component  and  perhaps  a  second  one  although  much  less  important  than  the 


first 


Table  7  shows  the  eigenvalues  and  eigenvectors  of  the  first  two  partial  correlation 


matrices.  The  two  main  eigenvalues  of  the  covariance  matrices  appear  again  linked  to  the 
largest  and  third  eigenvalues  of  P( 1 )  and,  less  clearly,  in  the  two  negative  elements 
of  P(2).  Table  8  shows  the  pattern  of  significant  coefficients  in  the  exact  maximum 
likelihood  estimation  of  the  parameters  of  an  ARM A  (1,1).  It  can  he  concluded  that  there 
is  a  very  complex  relationship  among  the  series  with  all  kinds  of  feedback  present.  The 
eigenvalues  of  the  matrices  8,8  and  £  -  8  are  shown  in  Table  9.  The  five  eigenvalues 
of  *  are  close  to  the  unit  circle,  what  can  wrongly  lead  to  the  conclusion  that  there  are 
five  nonstationary  factors.  However,  the  rank  of  £  -  8  is  nearly  two,  leading  to  the 
conclusion  that  there  are  2  factors  in  the  system.  The  eigenvectors  linked  to  these  main 
eigenvalues  are  again  very  close  to  the  previous  eigenvectors  of  the  covariance  and  partial 
correlation  matrices,  and  these  two  eigenvectors  appear  approximately  in  the  ♦  and  8 
matrices,  as  shown. 

In  summary,  there  is  strong  evidence  of  at  least  one,  and  probably  two,  common  factors 
driving  the  system.  The  first  is  linked  to  an  eigenvalue  that  is  broadly  speaking  a  mean 
of  the  series.  The  second  is  roughly  the  ratio  among  Avila  and  Zamora.  This  result  makes 
sense  because  this  second  factor  affects  the  two  more  western  provinces  that  are  closer  to 
the  Portugal  border. 


TABLE  7 


Elgenstructure  of  the  Partial  Correlation  Matrices 


.95 

.63 

.69 

.41 

.04 

.37 

.69 

-.94 

.51 

.02 

.49 

-.16 

2.79 

-.16 

.05 

.53 

-.05 

1.26 

-.20 

-.93 

.48 

-.31 

1.96 

-.16 

.27 

.57 

-.74 

4.94 

.90 

-.24 

P<1) 


-.47 

-.09 

.41 

.17 

.20 

.58 

.77 

.04 

.23 

.20 

.32 

-.23 

.51 

-.36 

-.77 

.48 

-.59 

-.53 

.95 

.43 

.38 

-.05 

-.34 

-.37 

-.  14 

.43 

.16 

.67 

.38 

1.40 

P  (2 ) 


-36- 


TABU  8 

Significant  Values  In  the  Parameter  Matrices 


TABLE  9 

Eigenvalues  of  the  Parameter  Matrices  and  Main  Eigenvector 


£-01  1.46 


.99(1) 

.98+. 081 

.98-. 081 

.84+. 141 

.84' 

-.51<U> 

.51(2a) 

.98+. 181 

.98-. 181 

1.46(1*> 

.34<2a) 

.14 

-.05+. 031 

-.05- 

.85 


We  have  chosen  the  eigenvectors  (.37  .49  .53  .48  .57)  and  (.69  000  -.74 
to  build  the  transformation.  Any  other  election  does  not  seem  to  change  the  results.  Wi 
this  two  generators,  the  matrix  g  turns  out  to  be 


52 

.07 

-.51 

-.48 

.49 

26 

-.88 

.09 

.31 

.24 

04 

-.15 

.70 

-.70 

.03 

41 

.41 

.45 

.41 

.38 

74 

.07 

.07 

.07 

-.66 

The  resulting  five  series  are  plotted  In  Figure  2.  The  new  £x  and  8^  for  the 
transformed  x  eerlee  obtained  transforming  the  parameter  matrices  of  z  turn  out  to  be 


~  V".1  ~  !■ 


i  i  n  m  hi 


M|M 


,-1 


M0M 


-1 


e  e 

.96 

. 

e  e 

• 

• 

.98 

s 

+ 

• 

.16 

.  -.71 

.94 

.18 

-.30 

.82 

e  e 

.92 

.25 

e  • 

.90 

.19 

.93 

.16 

e  e 

-.55 

.12 

.87 

e  e 

.20 

.50 

.33 

indicate 

a  parameter  value  below 

.1. 

-.19 

.84 

-.24 

-.27 


.15 

-.23 

1.0 

-.12 

-.19 


that  there  is  a  matrix  A  of  rank  equal  to  three  that  can  be  subtracted  from  both  sides. 
All  the  rows  of  this  matrix  are  orthogonal  to  the  vectors  (00010)  and  (00001)  and  so, 
according  to  Theorem  2,  could  be  subtracted  to  obtain  three  independent  sequences  of  white 
noise  plus  2  factors.  The  first  factor  is,  according  to  the  fourth  raw  of  M,  a  mean  of 
all  the  series  and  describes  the  general  trend  of  the  market.  The  second  factor  is 
stationary  and  affects  to  the  most  westerly  provinces,  Avila  and  Zamora,  and  although 
stationary  seems  to  have  an  ARMA  (1,1)  structure.  The  factors  are  not  completely 
uncorrelated  and  a  weak  relationahip  seems  to  exits  between  them. 

To  check  all  these  results  we  have  built  a  model  for  the  factors  from  scratch.  Table 
10  shows  the  parameters  of  the  model  and  Table  11  the  correlation  and  partial  correlations 
of  the  residual.  The  first  two  component  of  the  decomposition  are  stationary  AR(1)  process 
uncorrelated  at  all  lags  and  the  third  is  white  noise.  The  fourth  is  the  nonstationary 
component  that  describes  the  general  evolution  of  the  market  and  the  fourth  is  the 
differential  effect  of  the  occidental  provinces.  This  factor  turns  out  to  be  uncorrelated 
with  the  main  trend. 

The  results  are  sensible  and  confirms  that  the  five  series  are  driven  by  a  main  factor 
that  explains  their  nonstationary  behavior.  The  strongest  correlation  of  the  main  factor 
and  the  five  series  is  with  Valladolid  r  »  .99  that  was  the  greatest  producer  of  wheat  in 


Castillia  and  is  consistent  with  the  expected  behavior  "a  priori 


TABLE  10 


-.74  .95 

(.12)  (.02) 


.  .  .  -.50 

(.06) 


TABLE  11 


1 

.12  1 

.49  .28  1 

-.04  .08  .24  1 


10.34 

.59 

4.54 

-4.40 

.76 

9.21 

.08 

5.21 

4.34 

25.25 

2.53 

-.25 

.71 

3.79 

Parameter!  of  the  transformed  aeriea  and  correlation  among  reaiduala. 


As  a  further  check  of  the  methodology  we  have  analysed  the  wheat  price  aeriea  of  three 
province!  of  different  econoaiic  behavior  in  which  a  unique  conun  factor  is  not  expected. 
The  provinces  we  have  chosen  are  La  Ooruna,  that  is  not  wheat  producer  and  ia  the  North  of 
Spain,  Zaragoza,  that  although  producer  of  wheat  supplied  to  Aragon  and  Cataluna,  and 
Valladolid  again  aa  representative  of  Castillia.  Table  12  displays  the  eigenvalues  of  the 
covariance  matrices  and  although  there  is  a  large  eigenvalue  linked  to  the  general  mean, 
the  other  two  eigenvalues  are  fairly  stable.  The  partial  autocorrelation  coefficients  show 
3  strong  roots  and  the  estimation  of  the  model  in  Table  13  shows  that  $  -  0  has  rank 
three  and  it  is  not  possible  to  reduce  the  dimension  of  the  system.  There  are  three 
factors,  and  the  three  are  nonstationary  as  expected. 


TABLE  12 


Eigenvalue*  and  Eigenvector*  of  the  Covariance 
Matricea  of  La  Coruna,  Valladollid  and  Zaragoca 


lag 

1 

3 

4 

5 

6 

A1 

486 

479 

459 

441 

422 

402 

.42 

.43 

.44 

.46 

.47 

.48 

.49 

.47 

.45 

.43 

.40 

.38 

.76 

.77 

.78 

.76 

.79 

.79 

1 

2 

3 

4 

5 

6 

X2  50 

47 

43 

43 

41 

39 

.64 

.64 

.66 

.66 

.68 

.66 

-.75 

-.75 

-.74 

-.74 

-.73 

-.75 

.18 

.19 

.19 

.20 

.20 

.22 

1 

2 

3 

4 

5 

6 

X3  9 

7 

4 

5 

4 

3 

.67 

.68 

.69 

.68 

.68 

.68 

.39 

.37 

.37 

.38 

.37 

.37 

-.63 

-.62 

-.63 

-.63 

-.63 

-.63 

TABLE  13 


Eigenvalue*  of  the  Parameter  Matricea  of  the  Model 
for  La  Coruna,  Valladolid  and  Zaragoza 


\mm 

mnRn 

mm 

1  .92  .73 

i  •  a  - 

-.6  -1.5  1 

1  2  0 

0  1.0  0 

nz 

K9HHE9I 

■ 

1  .78  -4.5 

0  .70  0 

0  .71  1.41 

6.2  The  Berlea  of  Stock  Price*  Index** 

The  three  eerlea  studied  are  monthly  average*  of  the  Dow  JOnea  Index  of  30  Industrial 
Stocks  (DJ  series),  the  Stfindurd  and  Boor  Industrial  Stock  Index  (ID  series),  that  Includes 
400  stocks,  and  the  Standard  and  poor  Stock  nice  Index  (SP  series)  that  Includes  500 
companies  chosen  from  all  sector*  of  the  0.8.  economy,  The  sample  period  Is  from  January 
1966  to  December  1981.  Figure  3  shows  th*  evolution  of  the  three  series  that  have  been 
standarlzed  removing  their  means  and  dividing  by  their  atandard  deviation  to  show  the  three 
in  a  common  scale.  Dtbl*  4  shows  th*  univariate  models. 

we  will  report  her*  th*  analysis  on  th*  logarithm  of  th*  series  although  the 
conclusion  are  robust  to  th*  scale  of  meaaurement.  Table  IS  displays  th*  eigenstructure  of 
the  covariance  matrices  and,  as  the  spread  of  Dow-Jones  is  quite  different  from  the  other 
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two,  the  eigenvalues  and  eigenvectors  of  the  correlation  matrices  ars  also  presented.  loth 
aat  of  matrices  ahow  thraa  wary  atabla  eigenvectors.  The  eigenvector  link ad  to  tha 
greatest  elganvalos  indicates  that  tha  flrat  factor  la  a  weighted  aean  of  the  three 
series.  Thus,  it  can  ha  interpreted  aa  the  general  trend  of  tha  atoek  market,  the  second 
factor  aeeaia  to  taka  into  account  tha  behavior  of  large  indnatrial  corporations  (as 
measured  by  the  Daw  donas  Index)  with  respect  to  tha  raat  of  tha  market,  the  third 
eigenvalue  is  very  small  and,  consequently,  tha  system  asses  to  be  driven  by  two  factors. 
Tbs  third  eigenvalue  represents  a  constant  relationship  among  the  three  aeries  in  the  whole 
saeple  period,  this  relationship  is 

.66  in  ft  +  .09  in 

LB  LB 

and  is  a  weighted  combination  of  the  performance  of  the  industrial  sector  (defined  by  LX 
and  LD)  with  respect  to  the  whole  of  the  market  (Lg).  the  fact  that  the  eigenvalue  linked 
to  this  eigenvector  is  equal  to  aero  me ana  that  this  relationship  has  bean  stable  in  the 
stock  market. 

Identification  suggests  a  multivariate  kWh  (1,1)  model  for  the  vector  of  the  three 
indexes.  When  fitted  by  maximum  likelihood,  and  after  deleting  the  nonsl9>lficant 
parameters,  model  A  of  Table  16  is  Obtained.  The  2  parameter  matrix  has  the  remarkeabla 
property  that  rows  one  and  two  are  nearly  identical  and  the  rank  of  6  is  two.  Actually, 
the  2  Matrix  could  be  decomposed  as  the  addition  of 

mm  14 

univariate  Models  for  Stocks  Serlss 


LX  •  An  of  industrial  Standard  and  Poor  Index*  Lt  -  in  of  Standard  and  Poor  Index* 

ID  *  An  of  Dow  Jones  Index.  0(A)  is  the  Lunj-lox  statistic  with  A  degrees  of  freedom. 


V  tj.fj. 


a  matrix  similar  to  the  0  matrix  of  model  B  (see  Table  16B)  plua  a  matrix  with  the  three 
rowa  equal  to  (2.7  -3.8  .92).  Calling  this  rank  one  matrix  A,  this  implies 

*  2 

that  means  that  a  noise  sequence  can  be  expressed  as  a  linear  combination  of  the  other 

two. 

An  interesting  fact  is  that  the  vector  that  defines  this  combination  is, 
approximately,  proportional  to  the  eigenvalue  linked  to  the  smallest  eigenvalue  in  the 
covariance  and  autocorrelation  matrices.  Thus,  the  same  restriction  affects  to  both  the 
noises  and  the  vector  of  series.  Calling  m'  this  common  vector  we  have 

m'r.  *  0 

-  -t 

m'a_  •  0  . 

-  -t 


For  these  results  to  be  in  agreement  with  the  model  for  the  series,  as 


which  means  that  9  oust  be  equal  to  kj,  where  k  is  a  scalar.  The  matrix  £  of  model 
B  suggest  that  this  can  be  a  reasonable  model.  The  maximum  likelihood  estimation  with  the 
restriction  that  the  matrix  2  be  diagonal  leads  to  model  C.  Mote  that  the  fit  of  the 
model  as  measured  by  the  diagonal  elements  of  has  not  worsened,  and  the  diagnostic 

checks  does  not  indicate  any  problems  in  the  estimated  residuals.  The  three  diagonal 
elements  of  0  in  model  C  are,  approximately,  equal  and  so,  the  final  model  for  the  series 
is: 

V;t  “(It  ,22BI)at  . 

As  the  $  matrix  is  the  identity  it  does  not  provide  information  about  eigenvectors 
in  this  case.  However,  we  noticed  that  (a)  the  model  for  the  common  factors  mist  be 
Vyt  »  at  because  the  eigenvalues  of  0,  that  are  all  equal  to  one,  must  match  the 
eigenvalues  of  the  factors,  (b)  In  this  particular  case  the  factors  are  undetermined 
because  any  orthogonal  transformation  of  yt  will  produce  exactly  the  same  model  for  the 
factors,  (c)  we  can  choose  the  orthogonal  transformation  in  order  to  result  in  orthogonal 
columns  in  the  vector  £,  (d)  If  we  do  that,  the  columns  of  |  will  be  eigenvectors  of 

the  covariance  matrices. 

Choosing  this  representation,  we  first  notice  that  the  two  main  eigenvectors  of  the 
covariance  matrices  (Table  15)  are  orthogonal.  Then  taking  these  eigenvectors  as 
generators  of  the  transformation  the  matrix  IJ 

.68  .64  .37 

{J  -  .32  .18  -.92 

+.66  -.75  +.08 

This  produces  three  new  aggregates  x.  «  My.  .  The  first  could  represent  the  general 
tendency  of  the  stock  market.  The  second  could  be  interpreted  as  the  behavior  of  large 
corporation  versus  the  rest  of  the  market .  The  third  is  the  ratio  of  the  industrial  sector 
versus  the  general  market.  This  third  component  is  approximately  the  linear  combination  of 


th«  Ntl*i  that  has  a  variance  vary  close  to  aero.  These  three  coupon ante  are  shown  In 


rigors  4.  The  variance  of  the  first  Is  the  largest  eigenvalue  of  r„  (.0440)  the  variance 
of  the  second  less  than  7%  of  the  first  (.0030)  and  the  third  has  a  variance  that  ie 
0,0023%  of  the  variance  of  the  Mean  factor  and  could  be  considered  nearly  constant. 

However,  when  we  apply  the  transformation  N  to  the  matrices  |  and  J,  since 
£  •  1  and  ^  *  ~. 221  these  parameter  matrices  do  not  change.  Then,  the  vector 
follows  the  same  model  as  t^,  which  means  that  the  third  component  of  although 
almost  saro.  Is  a  nonstationary  factor.  This  factor  is  displayed  in  Figure  5  In  a  very 
augmented  soale  and  in  fact  it  Is  clear  that  it  la  nonstationary.  It  increases  slowly  but 
steadily  until  1974,  suggesting  that  the  stock  prices  of  Industrial  corporations  were 
gaining  against  the  general  market  until  1974.  Then,  this  tendency  broke  down  and  industry 
began  loosing  steadily  position  against  the  general  market.  This  component  nay  indicate  a 
relative  deterioration  of  O.B.  industry  after  the  74  oil  crisis. 

The  conclusion  from  this  example  is  that  we  need  to  carry  out  the  complete  analysis  to 
Obtain  a  clear  picture  of  the  system.  As  we  stressed  in  section  5,  if  the  variance  of  one 
factor  ie  very  smell  in  relation  to  all  the  others,  this  factor  nay  be  overlooked  in  the 
etudy  of  the  rank  of  the  covariance  and  partial  autocorrelation  matrices.  However,  it  will 
show  up  in  the  final  multivariate  ARIMA  model  and  will  be  easily  detected  when  applying  the 
transformation  to  recover  the  factors. 
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7,  COHPAglSOH  WITH  OTHER  APPROACHES 

Quenoulli  (1957)  suggested  to  use  the  eigenvectors  and  eigenvalues  of  ^u>  £0 ( *)  to 
build  transformation*  that  allow  aiaplar  intarpratation  of  tha  fittad  modal  and/or  to 
identify  tha  poaaibla  number  of  factors.  We  now  revise  briefly  tha  properties  of  those 
transformations. 

Starting  with  the  simplest  case,  the  one  factor  model  it  has  been  shown  that  both 
matrices  have  the  representation 

*iB'  +  ie 

and  so  ( 1 )  procedures  based  on  either  the  rank  or  the  eigenvalues  of  ^  will  be  similar 

to  those  based  on  these  same  properties  of  rfl(s)»  (2)  both  matrices  are  full  rank 

matrices  if  is  not  degenerate;  (3)  the  eigenvectors  of  either  or  £^(0)  will 

2 

only  coincide  with  those  of  PP*  when  E  -  c  I.  It  turns  out  then  that  in  the  on*  factor 
*  mm  ■€  E" 

2  2 

case  only  when  Ee  •  C£ I  and  cg  is  vsry  small  compared  to  Yg(y)  (see  Section  5) 
principal  components  based  on  either  r^(0 )  or  will  obtain  the  same  results  as  the 

method  suggested  in  this  paper. 

In  the  multifactor  situation,  the  eigenvectors  of  £Q(s)  and  ^  needn't  be  the  seme 

but  the  above  consideratlona  still  apply.  As  T  (0)  -  PT  (O)P'  +  E  principal  components 

*  — y  “ 

in  the  original  data  will  depend  on  the  measurement  error  matrix  Eg,  and  the  same  is  true 
for  the  matrix  . 

Box  and  Tlao  (1977)  suggested  the  use  of  the  eigenvectors  of  the  matrix 

Q  -  r"1 (0)£2(0)  (7.1) 

■*  *  p 

where  it  is  assumed  that  follows  an  autoregressive  process  £t  “  ).  ♦  a^.,  and 

P 

£;<o>  -  l  !»£.(*> 

f-1 

and  they  proved  that  a  transformation  based  on  the  eigenvectors  of  Q  produces  a  set  of 
new  canonical  variables  that  are  ordered  from  least  to  most  prsdictable.  to  compare  this 
canonical  transformation  wih  the  one  recomnended  in  this  paper  let  us  assume  first,  that 
the  model  equetion  for  the  factors  is  AK(1)  and  second,  that  the  model  identified  and 


fitted  for  tho  observed  series  is  ar( 1 ) .  Than  tha  estimation  of  the  autoregressive 


paraaatar  will  ba 

*.  •  £;(iO°>  “  ^(DrcKyWir  ♦  s,)-1 

and  uaing  tha  expression  for  tha  firat  partial  autoregression  matrix  obtained  in  (2.6),  and 
inserting  it  into  (7.1), 

8  - 

if  m  now  apply  that 

+  tLW'Cl  "  +  £0<y,4) 

whera 

ft  -  I'S*S 

tha  general  axpraaaion  for  Q  lai 

8  -  ‘£*  ♦  S£o<y,S,r12ty£y(OHi  +  Ky*0”"1^0  V 

and  using  tha  wall  known  axpraaaion  for  tha  inversion  of  tha  sun  of  two  matrices  one  of 
than  being  nonsingular, 

a  -  -  £Vi + vo,*rVo,*,^1,“yyo>t* *  ftvo,r1£y(o,M' 


and,  after  aana  manipulations  and  taking  into  account  that 

1  -  (X  +  H)“*H  -  (I  +  H)”1 


wa  finally  obtain 

8  -  SVi  +  *  ft£y  < ®>  ^ftty « •  >tT  «?.2> 

Then,  if  2'  ia  such  that  j'j  -  j,  than 

gj  *  S*S'X  “  2 

and  so  tha  columns  of  V  are  eigenvectors  of  g  linked  to  the  sero  eigenvaluea  of  g. 
However,  expression  (7.2)  shows  that  neither  £  nor  gg  whan  g  is  an  orthogonal  matrix, 
will  be  in  general  eigenvectors  of  g  and  so  tho  canonical  transformation  will  fail  to 
recuperate  tha  factors.  The  reason  is  that  tha  linear  oombinationa  of  greater 
predictability  depends  not  only  on  the  factors,  but  on  tha  variance  of  the  signal  as  wall. 


I 


In  the  one  factor  model,  equation  (7.2)  reduce*  to 

o  -  eT’pp' *X. 

■  •£  —  Q 

with 

x  aY0(y>*v 
C  (1  +  ar0(y))2 

and  a  «  P'E_1p  as  before.  Then  (  'p  is  the  vector  linked  to  the  nonzero  eigenvalue  of 
-  ■£  -  “6  - 

2  that  is  aXg.  As  approaches  the  unity,  if  aYQ(y)  is  much  greater  than  one,  X^ 

will  approach  unity  too,  in  agreement  with  Box  and  Tiao  (1977)  theorem.  In  addition,  the 
eigenvalues  linked  to  zero  eigenvectors  of  2  roust  be  linear  combinations  of  those  of 
PP*  and,  calling  the  matrix  whose  rows  are  the  eigenvectors  of  g,  JT'J  • 

Therefore  both  procedures  will  come  out  with  a  different  component  plus  k  -  1  white 
noise.  In  the  canonical  transformation  this  single  component  depends  on  £  and  on  ^ 
whereas  the  factor  analysis  decomposition  depends  only  on  P. 
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8.  CONCLUSION 

It  has  b««n  shown  that  the  multiveriete  tl«s  series  model  for  an  observed  vector  of 
tine  series  may  appear  vary  complicated,  when  in  fact  a  simple  representation  using  a  small 
set  of  common  factor  is  adequate.  Besides,  failing  to  notice  this  structure  can  be 
dangerous,  because  the  interpretation  of  the  Internal  dynamic  that  we  have  to  draw  from  the 
model  will  be  completely  different  in  both  cases. 

Onlike  in  the  factor  analysis  of  static  variables,  the  restrictions  that  the  presence 
of  common  factors  put  on  the  covarianoe,  correlation,  partial  autocorrelation  and  parameter 
matrices  make  the  identification  of  the  number  of  factors  and  the  estimation  of  the  factor 
loading  matrix  relatively  easy. 

Box  and  Tiao  (1977)  and  Tiao  and  box  (1981)  recommend  the  computation  of  eigenvalues 
and  eigenvectors  of  different  matrices  as  a  useful  source  of  valuable  Information.  It  has 
been  shown  the  relationship  between  this  eigenvalues  and  eigenvectors  in  the  context  of  a 
common  number  of  factors  and  how  a  transformation  can  be  built  to  obtain  a  parsimonious  and 
intelligible  representation  of  the  observed  vector  of  time  series. 
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